Maithili Text to Speech Corpus

0 reviews requests (2)

Owner Central Institute of Indian Languages

Catalogue Number: 1515

Stock In Stock

OverView

30:59:20 hours | 19.56 GB | 32260 Audio Segments | 2 SpeakersThe LDC-IL Maithili Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Devanagari script. This dataset spans a ...

Please Login to see the price

Tags: Maithili; Text to Speech; TTS Corpus

Categories Cart Account Search Recent View Go to Top

Dataset Description

30:59:20 hours | 19.56 GB | 32260 Audio Segments | 2 Speakers

The LDC-IL Maithili Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Devanagari script. This dataset spans a duration of 32:42:20 (hh:mm:ss) , consisting of read speech in the studio setup. The data is derived from 01 female and 01 male native Maithili speakers. A comprehensive explanation of dataset can be found in the Maithili Text to Speech Documentation.

For any research-based citations, please use the following citations:

Shantanu Kumar, Dinesh Mishra, Saurabh Varik, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Maithili Text to Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-36-1.
Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.

Item specifics

Authors Shantanu Kumar, Dinesh Mishra, Saurabh Varik, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan
Corpus Type TTS Corpus
Catalogue Number 1515
ISBN 978-93-48633-36-1
Data Source Studio
Duration 30:59:20 hours
# of Audio Segments 32260
Release Date 20/03/2025
Terms and Conditions General instructions for use of the resources provided by LDC-IL.

Maithili Text to Speech Corpus

OverView

Maithili Text to Speech Corpus

Dataset Description

Item specifics

Write a review